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ABSTRACT 

Aims. We measure the clustering properties for a large samples of u- (z ~ 3), g- (z ~ 4), and r- (z ~ 5) dropouts from the Canada- 
France-Hawaii Telescope Legacy Survey (CFHTLS) Deep fields. 

Methods. Photometric redshift distributions along with simulations allow us to de-project the angular correlation measurements and 
estimate physical quantities such as the correlation length, halo mass, galaxy bias, and halo occupation as a function of UV luminosity. 
Results. For the first time we detect a significant one-halo term in the correlation function at z ~ 5. The comoving correlation lengths 
and halo masses of LBGs are found to decrease with decreasing rest-frame UV-luminosity. No significant redshift evolution is found 
in either quantity. The typical halo mass hosting an LBG is M ^ 10'"/i"'Mg and the halos are typically occupied by less than one 
galaxy. Clustering segregation with UV luminosity is clearly observed in the dropout samples, however redshift evolution cannot 
clearly be disentangled from systematic uncertainties introduced by the redshift distributions. We study a range of possible redshift 
distributions to illustrate the effect of this choice. Spectroscopy of representative subsamples is required to make high-accuracy 
absolute measurements of high-z halo masses. 
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1. Introduction 

More than a decade after its first applications the Lyman-break 
technique ( ,Steidel et al.|1996l Giavalisco,2002 ) is still the most 
widely used method to study galaxies in the high-redshift uni- 
verse. Recently, our knowledge of these early cosmic epochs 
has vastly increased providing an opportunity to test theories 
of galaxy formation. One crucial aspect of these models is the 
connection between the precisely-simulated dark matter (DM) 
structures and the properties of the galaxies forming inside of 
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them. Contemporary models predict that galaxy properties are 
tightly connected to the underlying dark matter halo hosting the 
galaxy, in particular the halo mass. 

N-body simulations of the DM distribution and its evolution 
predict the clustering properties of DM halos which are strongly 
coiTelated to the halo mass. Since galaxies are assumed to fol- 
low the DM, measuring the clustering properties of galaxies with 
different properties, e.g. different luminosities, allows us to link 
these properties to the mass of the underlying dark matter halos. 
However, galaxies are generally a biased tracer of the DM halos, 
so the link is indirect. 

Recently, several studies ( Ouchi et al.|2005 Lee et al.|2006 



|Hildebrandt e t al. 2007 ) have detected deviations from the large 
scale power-law behaviour on small scales in the coiTelation 
functions of Lyman-break galaxies (LBGs). These deviations are 
explained by the halo model (see e.g. Seljak|[2000 Cooray & 
|Sheth|[2002| ) as galaxy pairs in the same halo (one-halo term) 
allowing for a more accurate determination of the mean halo 
masses and also a measure of the mean number of galaxies 
hosted by a typical halo. 

Here, we present a precision study of clustering properties 
for three samples of LBGs at z ~ 3, z ~ 4, and z ~ 5. The sam- 
ple is coherently selected from one dataset, the CFHTLS Deep 
Survey, which controls for systematic errors that affect measure- 
ments of clustering evolution, such as the definition of selection 
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criteria, the role of different filter sets, masking, and the redshift- 
distributions. 

The data used for this study, the data reduction, and the cata- 
logue creation are presented in Sect. [2] The selection of the LBG 
samples together with their basic properties is covered in Sect. [3] 
In Sect. |4] we describe how we measure the correlation function 
of these galaxies and present the results of model fits to the data. 
These results are discussed and concluding remarks are given in 
Sect.E 

Throughout the paper we use AB magnitudes and we assume 
a ACDM concordance cosmological model (Q.^ = 0.7, Q.^ - 
0.3, (Tg = 0.9, Ho = 100/;-' h ^ 0.7). 

2. The data 

2.1. Overview and reduction 



For this work we consider publicly available and Elixir ( [Magnier 
& Cuillandre|2004 1 pre-processed data from the CFHTLS-Deep 
(ugriz-hands) and COSMOS survey (w-band). We retrieve from 
the CADC archive relevant images available at 21/07/2008. 

The data were observed with MEGACAM@C FHT and re- 
duced with the THELI imaging reduction pipeline ( jErben et"ar 
2005 ] l. The properties of the reduced images are summarised in 
Table [T] ( the limiting magnitudes reported in the table are 1 -cr 
limits in a circular aperture with a diameter of twice the seeing 
FWHM). The data reduction and catalogue creation are carried 
out identically to the one of the CFHTLS-Wide survey described 
in Erben et al. ( 2009| l. Here, we quickly summarise the most im- 
portant reduction- and catalogue creation steps: 

1. Retrieval of all available pre-reduced exposures from the 
CADC archive. 

2. Quality control of all 36 chips of each retrieved exposure to 
identify bad ones (e.g. chips with a large fraction of saturated 
pixels, etc.). 

3. Identification of satellite tracks with Hough transform tech- 
niques (Vandame^OOl). 

4. Creation of weight images for each chip including bad pixels 
and the satellite track masks. 



5. Extraction of catalogues with SExtractor (Bertin & Arnouts 



1 1996 ) for astrometric calibration, 
calibration 



6. Astrometric calibration of the ;-band images with the 
Astrometrix (Ra dovich ,2002[ [McCracken et al.||2003|l soft- 
ware and the USNO-B 1 astrometric catalogue ( Monet et al. 
|2003| as a reference. After the /-band image has been stacked 
(step 8), the other images are calibrated astrometrically with 
a catalogue extracted from that co-added /-band image as a 
reference. This is done to ensure an optimal alignment of the 
images in the different bands. 

7. Relative photometric calibration with the Photometrix soft- 
ware and absolute photometric calibration with the methods 
described in Hilde faandt et al.| ( |2006| l. 

8. Co-addition with the SWarp software ( |Bertin|2003l ). 

9. PSF equalisation by degrading all bands of one particular 
pointing to the seeing of the image with the worst seeing. 
This is done by convolution with appropriate Gaussian ker- 
nels. 

Creation of limiting magnitude maps from the noise proper- 
ties of the co-added images. 

SExtractor runs in dual-image mode with the unconvolved 
(i.e. not degraded to a worse seeing) /-band image as the de- 
tection image and the convolved images in all five bands as 
measuring images to extract a multi-colour catalogue. 
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12. Automated masking of the image stacks to mask out low-S/N 
regions and regions affected by stellar diffraction spikes and 
stellar light halos as well as asteroid tracks. 



2.2. Piiotometric redsliifts 



In contrast to the method applied in Erben et al. ( 2009 1 we mod- 
ify the estimation of photometric redshifts (photo-z) in the fol- 
lowing points: 



Galactic extinction is included on an object basis from the 
beginning. In Erben et al. (20091 we relied on the average 



removal of extinction effects by means of zeropoint (ZP) re- 
calibration on fields with spectroscopic coverage. This step 
decreases the scatter of the photo-z's slightly. 
We substitute the template set (the four empirical templates 
from Coleman et al.^ 1980 plus two starburst templates from 
Kinney et al.|1996 1 by a re-calibrated version of these tem- 
plates f rom|Capak (|2004|l. Th e same template set was also 



used by Mobasher et al. ( 2007 1 and is based on the technique 
described in Budavari et al. (1999). The new template set 
has an impact on the accuracy of the photo-z's at higher red- 
shift. The systematic underestimation of the redshifts, seen 
in [Erben et al] ( 12009 1 on the CFHTLS-Wide data, vanishes. 
We re-run the ZP re-calibration matching the colours of ob- 
jects with spectroscopic redshifts with the colours of the 
best-fit template at that particular redshift (see Erben et al. 
|2009| for details of the procedure). For the D4 field, which 
does not have any spectroscopic overlap, we apply the ZP 
offsets found on the Dl field, which has the largest overlap 
with a spectroscopic catalogue and thus potentially the most 
accurate re-calibration. 

The priors used in the BPZ code ( Benitez 2000[ l are modified 
to increase the probability of objects being assigned a low 
redshift [z ^ 0.2). In this way we are also able to remove the 
systematic overestimation of redshifts for z ^ 0.2 as reported 
in |ErbenetaT] ( |2009l l. 



By these measures a tilt in the photo-z vs. spec-z comparison, 
as reported in Erben et al. ( 2009 1 can be avoided. 



With thousands of pubUc spectroscopic redshifts from the 
VIMOS VL T Deep Survey (VVDS) on the Dl field (L e Fevre 
ler aL||2005] l, the zCOSMOS survey on the D2 fiel d (.Lilly et al. 
20071, and the DEEP2 survey on the D3 field ( [Davis et aL 
,2007 1, we test the performance of our photo-z's. For a safe 
sample (ODDS > 0.9; see al so pemtez 20C^ [Hildebrandt 
|et al.|[2008] [Erben et aL][2009| l in the magnitude range 17 < 
/ < 24 we find a scatter of cr = 0.033 of the quantity Az = 
(zphot - Zspec) / (l + Zspec) after rejecting 1.6% of outUers (objects 
with Az > 0.15) and no significant bias. Figure [T] shows a com- 
parison of the spectroscopic redshifts with our photo-z's on the 
three fields. Note that these accurate low-redshift photo-z's for 
relatively bright objects do not imply directly a similar photo- 
z performance at higher redshifts and fainter magnitudes. This 
results should be regarded more as a quality check for the data 
reduction and the multi-colour photometry. 

Furthermore, we also carried out a cross-correlation analy- 
sis of galaxies in photo-z bins. The results can be found in the 
appendix [A| 
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Fig. 1. Comparison of the BPZ photo-z's to spectroscopic red- 
shifts from the VVDS, zCOSMOS, and DEEP2 (2291, 232, and 
2705 objects with good spectroscopic flags and ODDS > 0.9 in 
the Dl, D2, and D3 fields, respectively). 



3. The LBG samples 

3.1. Simulations 

We simulate a colour catalogue of galaxies based on templates 
from the library of |Bruzual & Chariot] (|1993[ ) - in the same 
way as presented in [Hildebrandt et al .1 ( 12007 1 for the ESO Deep 
Pubhc Survey (DPS): 



- We take the observed /-band number counts as a reference 
for creating the simulated colour catalogue. 

- i-band magnitude-dependent and spectral-type-dependent 
redshift distributions are extracted from the BPZ code. These 
are based on catalogues extracted from the Hubble Deep 
Field North ( |Fernandez-Soto et al.l 1999| l and the Canada- 
France Redshift Survey (Lilly et a l.|1995| . 

- A huge multi-colour catalogue is simulated with the Hyperz 
([Bolzonella et al. 2000) photo-z code evenly distributed in 
!-band magnitude, redshift, and spectral-type (eight diff'erent 



star-formation-histories applied to the library of Bruzual & 
|Charlot|1993] l. 

- From this huge homogeneous catalogue we randomly pick 
objects for a given magnitude, type, and redshift bin. The 
numbers in a particular bin are scaled by the /-band num- 



ber counts and the magnitude- and type-dependent redshift 
distributions mentioned above. 

The colour catalogue simulated in this way shows very sim- 
ilar colour-colour diagrams as the observed catalogue (see also 
Hildebrandt et al._2007) and is supposed to represent a realistic 
mix of galaxy types and low- and high-redshift objects. 

In order to study the eff'ect of the choice of the template 



set we re-run the simul ations with the templates from |Maraston 
et al. (2006) (see also Maraston'"2005) as a basisQln general, 
the results are very similar to the ones with the template set from 
Bruzual & Chariot ( 1993 1. The scientific results in this study are 
hardly affected by this choice. Nevertheless, we show results for 
both template sets in the remainder of the paper 

Furthermore, we simulate the colours of stars in our fields 
from the TRILEGAL galactic model ( |Girardi et al.|2005| ) taking 
into account the galactic coordinates of our survey fields and the 
depths of the images (see also Hildebrandt et al.|2007| l. 



3.2. Selection 

From these simulations we identify regions in colour-space 
where the efficiency of finding high-redshift star-forming galax- 
ies is high due to their distinctive colours produced by the 
Lyman-break and where the contamination from low-redshift in- 
terlopers and stars is low. We do this in the ugr, gri, and riz 
colour-spaces to select M-dropouts, g-dropouts, and r-dropouts, 
respectively. In Fig.|2]the fraction of objects in the desired red- 
shift range (M-dropouts: 2 < z < 4; g-dropouts: 3 < z < 5; 
r-dropouts: 4 < z < 6) is plotted as a function of colour. In order 
to steer away from the stellar locus, not the whole high-efficiency 
area in the u - g vs. g - r colour-space can be used. 
Our colour selection criteria are: 

- for the M-dropouts: 1.5 < {u- g) A -1 .0 < (g - r) < 1 .2 A 
1.5 ■ig-r)<iii-g) -0.75, 

- for the g-dropouts: 1.0 < (g - r) A -1.0 < (r - /) < 1.0 A 
1.5-(r-0<(g-r)-0.8, 

- for the r-dropouts: 1.2 < (r - /) A -1.0 < (/ - z) < 0.7 A 
1.5-(/-z)<(r-/)-1.0. 

Furthermore, we require all LBGs to have a SExtractor 
CLASS-STAR parameter of CLASS_STAR < 0.90 that g- 
dropouts are not detected in m, and that r-dropouts are neither 
detected in m nor in g. In this way we select 34218 M-dropouts, 
36 226 g-dropouts, and 10482 r-dropouts from the effective area 
of 3.17 sq. deg., i.e. the area after masking (see |Erben et aL 



2009 for a detailed description of the masking algorithms). 



' These templates can be retrieved from http : / /www . i eg . port . 
ac.uk/~inaraston/hyperz- templates/1 

- This cut on the compactness of the objects assures a rejection of 
most stars. The image quality of our /-band images is very high with 
a seeing FWHM:S 0'.'7. Thus we can still separate reliably most high-z 
galaxies from stars. 
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Fig. 2. Colour-colour diagrams showing the fraction of objects at the target redshifts {left: 2 < z < 4; middle: 3 < z < 5; right: 
4 < z < 6) as a function of colour in our simulations based on the templates from Bruzual & Chariot ( 1993 1, with black areas 
corresponding to 100% and white to 0% selection efficiency. The solid lines represent the selection criteria adopted in this study. 




g-r 



Fig. 3. Same as Fig.jijbut based on the templates from 
make any difference here. 



Maraston et al. 



( 2006 \ showing that the choice of the template set does not 



3.3. Properties 

The number counts of the three samples are plotted in Fig. [4] 
They are in good agreement with previous studies, but the supe- 
rior quality of the CFHTLS-Deep data for this kind of analysis 
is clearly visible, most dramatically for the r-dropout sample. 

The number counts suggest approximate completeness limits 
of r = 25.5 for the / = 26.0 for the g-, and z = 26.5 for the 
r-dropouts. These completeness limits should not be regarded 
as robust. We only choose them by eye. Extensive simulations 
are prepared to quantify reliably the completeness as a function 
of magnitude. These are important for estimates of luminosity 
functions but are beyond the scope of this study, especially since 
the clustering measurements are not affected strongly by some 
low amounts of incompleteness. 

From the simulated catalogues described above we find that 
the contamination from stars and low-z interlopers can be kept 
below the 10% level in each magnitude bin (bin-size Amag - 
0.5) by applying a bright cutoff of r > 23 for the / > 23.5 for 
the g-, and z > 24 for the r-dropouts (for a discussion on how 
to avoid contaminants in an iJ-dropout survey by a bright cutoff 
see also Lehnert & Bremer| |2003| l. These magnitude cuts de- 
fine our high-quality samples which consist of 17 338 u-, 17 281 
g-, and 7038 r-dropouts. In the following, only LBGs in these 
magnitude ranges are considered. 



3.4. Redsinift distributions 

The redshift distributions of the LBG samples are of crucial im- 
portance in the physical interpretation of the angular clustering 
results presented below. There is virtually no overlap between 
the secure spectroscopic samples of the VVDS, zCOSMOS, and 
DEEP2 and our high-quality dropout samples. Nearly all LBG 
candidates are too faint and the brighter ones do not have good 
spectroscopic measurements as indicated by their spectroscopic 
flags. Here, we present four different ways to estimate the red- 
shift distribution by other means. All four sets of redshifts distri- 
butions, which are displayed in Fig. |5] will be used in Sect. 4.2 
to interpret the observed clustering. 



3.4.1. Photo-z distributions from BPZ 

The photometric redshift distributions of the selected objects, 
estimated with BPZ and filtered for a BPZ ODDS parameter of 
ODDS > 0.9, show clear peaks at the targeted redshifts. The 
means and RMS-widths of these peaks are z = 3.28 + 0.15, z = 
3.87 + 0.32, and z = 4.74 + 0.14. While the mean redshifts agree 
well with expectations, the widths of the distributions for the u- 
and the r- dropouts seem too small. This may well be an effect of 
the aggressive prior used by BPZ or the empirical template set, 
which is based on low-z observations. 
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Fig. 4. Number counts for the three dropout samples (M-dropouts left, g-dropouts middle, and r-dropouts right) in the bands corre- 
sponding to approximately the same rest-frame UV-wavelength. We compare our M-dropout number counts to the ones from Steidel] 
et al. (1999 1 and Hildebrandt et al. (2007) clearly showing the greater depth of the CFHTLS-Deep data. The g-dropout number 
counts are compared to [Steidel et al. ( 1999[ l being consistent for ; < 24.5 and suggesting some incompleteness of the older survey 
for fainter magnitudes. The 13 7?-dropouts reported in |Lehnert & Brerner| ( |2003| show a similar number density as the r-dropouts in 
the current study. 



If we run BPZ without the prior, objects with a low ODDS 
parameter that were assigned a low redshift before are assigned 
a high redshift. This is exactly the behaviour one expects from 
double peaked redshift probability functions. For fixed ODDS 
the width of the z-distributions does not change from the prior- to 
the non-prior-setup. If the prior is switched off and additionally 
the cut on ODDS > 0.9 is dropped the z-distributions change 
only slightly. 

It is the width of the redshift distribution rather than the mean 
redshift that is important for the interpretation of the cluster- 
ing measurements (correlation lengths, galaxy bias, halo masses, 
and halo occupation numbers) described below. 



3.4.2. Photo-z distributions from Hyperz 

An independent check of the photo-z's with Hyperz and the tem- 
plate set based on the library of Bruzual & Chariot ( 1993) yields 
the following means and RMS-widths: z - 2.93 ± 0.24 for the 
U-, z = 3.67 ± 0.31 for the g-, and z = 4.47 ± 0.22 for the r- 
dropouts. The distributions are shown in Fig.|5]as well. While 
the Hyperz RMS widths seem more realistic, the mean redshift 
values are slightly lower than the BPZ ones above and also lower 
than the theoretically expected ones. In |Hildebrandt et aT ( 2008| l 
we found that for a magnitude limited galaxy sample the photo- 
z's for this particular combination of code and template set and 
estimated from a similar filter set iUBVRI) are biased low at 
the ~ 5% level when compared to spectroscopic redshifts of the 
VVDS. This is most probably due to a slight mismatch of the 
absolute spectrophotometric calibration of the templates and the 
absolute photometric calibration of the data. Correcting for this 
bias would make the mean redshifts of the g- and r-dropouts 
very compatible with BPZ leaving only the w-dropouts slightly 
low, although it is questionable whether this bias will remain 
constant out to much higher redshifts than probed by the VVDS. 



cally. Thus, we regard the Hyperz redshift distributions as the 
least reliable ones. 



3.4.3. Redsliift distributions from simulations 

The redshift distributions which are derived from the simulated 
colour catalogues are show n in Fig. [5] as w ell. The ones based 
on the templates from Bruzual & Chariot ( 1993 1 suggest mean 
redshifts and RMS-widths of z = 3.16 + 0.22 for the u-, z = 
3.76 + 0.30 for the g-, and z ^ 4.73 + 0.24 for the r-dropouts. 



The multi-modality of the photometric redshift distributions 
hints at problems in the photo-z estimation. No such multi- 
modalities are observed in spectroscopic surveys of LBGs (see 
e.g. [Steidel et al.||1999] |2003| l nor are they expected theoreti- 



The simulations based on Maraston et al. (2006 1 templates yield 
means and widths of z = 3.30 + 0.24, z = 3.83 ± 0.32, and 
z = 4.77 ± 0.24 for the u-, g-, and r-dropouts, respectively. 

In the following we will refer to the four sets of red- 
shift distributions as the BPZ, the Hyperz, the BCsim, and the 
Maraston_sim distributions. 

By choosing the r-band as the reference magnitude for the 
M-dropouts, the /-band for the ^-dropouts, and the z-band for the 
r-dropouts we select galaxies at similar UV rest-frame wave- 
lengths (mean rest-frame wavelength of the filters is ~ 1560A for 
BCsim) so that the A:-correction between the different samples is 
negligible. The distance modulus between the mean redshifts of 
the u- and the g-dropouts and between the mean redshifts of the 
g- and the r-dropouts is ~ 0.39mag and ~ 0.65mag, respectively 
(again for BC^im). 



4. Clustering analysis 

4.1. Technique 

We estimate the angular correlation function cl)(6) of the different 
flux-limited subsamples by applying the estimator of |Landy"& 
Szalay (1993). Errors are estimated from the field-to-field vari- 
ance of the four fields. We choose a common angular binning 
for all samples but we check the influence of this choice (see 
below). The same masks as for the object detection are used for 
creating the random catalogue, which is necessary to estimate 
the correlation function. See Fig. 6]for the correlation functions 
of the different magnitude-limited LBG samples. 
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Hyperz photo-z 



Bruzual & Chariot (1993] (si 





Maraston et al. (2006) (sim) 




Fig. 5. Redshift distributions (arbitrarily normalised) of the three dropout samples (w-dropouts solid, g-dropouts dotted, and r- 
dropouts dashed). The first panel shows the BPZ photo-z distributions, the second panel shows the Hyperz photo-z distributions, the 
third panel shows the distributions derived from the simulated colour catalogue based on Bruzual & Chariot (1993 ) templates, and 
the last panel shows the distributions derived from the simulated colour catalogue based on , Maraston et al., ( |2006| ) templates. 



We fit a power-law, (jj(6) = A 6^^, to the angular correla- 
tion functions in the angular region O'l < < 10'. A de- 
projection with Limber's equatioq^(Limber 1953| involving the 
redshift distributions shown above (see of Fig. |5]l yields the 
real-space correlation functions approximated by a power-law, 

^(r) = (^) ''' , with ro being the comoving correlation length and 
y = 6 + I being the slope of the correlation function. We correct 
iteratively for the integral constraint bias with the method ex- 
plained in Adelberger et al. ( 2005| l. In this way we also estimate 
values for the large-scalajgalaxy bias factor, bga\' 



(1) 



The galaxy fluctuations in spheres of radius 8/i"'Mpc can be es- 
timated from the power law fit to the correlation function in the 
following way ( Peebles|1980| l: 



72(ro/8riMpc)'' 

(3-r)(4-r)(6-r)2^ 



(2) 



whereas the corresponding DM fluctuations at redshift z, cr8(z), 
are calculated from theory. 

4.2. Results 

The results are summarised in Tables [2]|5] and the dependence 
of different measured quantities on redshift, magnitude, and the 
assumed redshift distribution is visualised in Fig. |7] The errors 
for ro and bg.^j in Fig. l7| are estimated in Monte Carlo realisa- 
tions from the errors or the power-law fit. Therein, we assume 
uncoiTelated Gaussian eiTors of the amplitude and the slope of 
the power-law. In the lower S/N domain the error introduced by 
the binning of the data becomes dominant over the fitting error. 
We add this contribution in quadrature, which is estimated from 
the scatter for many diff'erent binnings. 

There is clear evidence for clustering segregation with rest- 
frame UV luminosity for all three dropout samples regardless of 
the assumed redshift distribution. Brighter samples cluster more 
strongly than fainter ones as reflected by their correlation lengths 
and bias values. 



See 



Simon 



e discussion of the accuracy of 
Limber's equation in this kind of applications. The redshift distributions 
of the LBGs are wide enough to not bias our clustering measurements 
significantly when applying Limber's equation. 

The fitting ranges correspond to comoving scales of > 130/i"'kpc. 



The observation that the correlation function slope on inter- 
mediate and large scales gets shallower for fainter galaxies as 
reported by Ouchi et al. (2005), Kashika wa et al.| ( |2006| ), and 
Hil debrandt et al.| (i2007 ) is also detected in our u- and g-dropout 
samples. Furthermore, we also detect a similar levelling-off of 
the slope for the two faintest bins as reported by Hildebrandt 
et al. ( 2007) for a f/-dropout sample. The flattening of the cor- 



relation function for fainter magnitudes is not observed for the 
r-dropouts. They show a constant, rather steep slope of y » 2.1 
for aU magnitude limits. 

The measurement of correlation lengths is strongly de- 
pendent on the assumed redshift distributions which have not 
been spectroscopically verified. However, the M-dropout samples 
show smaller correlation lengths than the higher redshift samples 
for the BPZ, the BCsim, and the Marastonsim redshift distribu- 
tions, indicating some evolution may be occuiTing. 

A much clearer redshift dependence is observed for the val- 
ues of the large-scale galaxy bias again for the BPZ, the BCsim, 
and the Maraston _sim redshift distributions. From z ~ 5 to z ~ 3 
the galaxy bias decreases by a factor of two. This more pro- 
nounced trend in comparison to the correlation lengths can eas- 
ily be explained by the ongoing structure formation in the early 
universe. DM halos with roughly the same comoving correla- 
tion length are more biased tracers of the underlying dark matter 
field at high redshifts than at lower redshifts. This is due to the 
fact that the amplitudes of the overall DM density field grow 
with time. Thus, the scale where the correlation function takes a 
value of one (the correlation lengths) increases over time. 



In Hildebrandt et al. (2007) we found clustering segregation 



with UV luminosity as well but the correlation lengths of that 
{/-dropout sample were consistent with the ones from the B- 
dropout sample from Ouchi et al. ( 20051 for the same luminosity, 
showing no evolution with redshift. Besides the much larger sta- 
tistical power of the present survey, it should be noted that sev- 
eral systematic effects can influence such a comparison of dif- 
ferent datasets. For example, it is not clear whether intrinsically 
very similar populations were studied in Ouchi et al. ( 2005 1 and 
Hildebr andt et al.l(2007 ) because of the different filter sets, the 
different selection criteria, and the different depths of the data. 
Furthermore, masking can introduce biases in the clustering sig- 
nal that are hard to control. Most importantly (see above), the 
redshift distributions were obtained in very different ways with 
Hildebrandt et al. (2007 ) relying on Hyperz photo-z' s and Ouchi 
|et al.| ( |2005| l on simulations. Indeed, the non-evolution of the 
correlation length with redshift from z ~ 4 to z ~ 3 is seen also 
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Fig. 6. Angular correlation functions for the LBG samples of the three dropout populations (M-dropouts top, g-dropouts middle, and 
r-dropouts bottom) for different limiting magnitudes (corrected for the integral constraint bias with the BCsim redshift distribu- 
tions). The errors are estimated from field-to-field variance over the four CFHTLS-Deep fields. The solid lines represent power-law 
fits to the data in the angular region Of 1 < < 10'. For all samples we find a significant deviation on small scales from this power-law 
behaviour on larger scales. 



here in the Hyperz panel in Fig.jT] It might well be spurious due 
to the problems mentioned in Sect. |3.4.2 



In the present study we carry out a coherent analysis on 
one dataset. The selection criteria are taken from the same set 
of simulations and the masking is identical for all three sam- 
ples. However, without spectroscopic redshift distributions we 
cannot decide whether the evolutionary trend observed with the 
BPZ, the BCjiim, and the Mamston_sim redshift distributions 
or the non-evolution observed for the Hyperz redshift distribu- 
tion is real. The simulated redshift distributions have the advan- 



tage that they are not affected by a number of systematic errors 
inherent to the observations (except for the reliance on the ob- 
served /-band number counts) or spurious effects introduced by 
priors in the photo-z codes. Furthermore, the absolute results 
from the BCsim case and the Maraston^sim case agree very 
well, although different template sets were used. One disadvan- 
tage of the simulations is that both rely on assumptions about the 
mixture of templates and the fractions of high- to low-z objects 
which are taken from external data (the HDF-N and the CFRS). 
We tend to trust the results based on the simulated redshift distri- 
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Table 2. Clustering measurements of the three dropout samples 
for different flux limits using the BPZ redshift distributions. 









M-QIUpUUlS, Z 


— ^ 9S 

Tiean -'.^u 










/j„ 




J 


To 


''gal 


IC 










[/!-'Mpc] 








24.5 


5.45 X 10-" ± 5.55 x 10" 


-5 


1.85 ±0.06 


5.03 ± 0.35 


3.39 ± 


0.22 


0.0056 


25.0 


1.58 X 10-3 ± 1.59 X 10" 


-4 


1.67 ±0.03 


3.47 ±0.13 


2.41 ± 


0.08 


0.0028 


25.5 


3.43 X 10-3 ± 3.44 x 10^ 


-4 


1.60 ±0.02 


2.76 ± 0.08 


2.01 ± 


0.05 


0.0020 


him 






g-dropouts; z 


Tiean 3.87 








25.0 


5.20 X 10-" ± 5.28 X 10" 


-5 


1.95 ±0.08 


5.43 ± 0.39 


4.24 ± 


0.28 


0.0021 


25.5 


1.32 X 10-3 ± 1.33 X 10- 


-4 


1.68 ±0.08 


4.64 ± 0.27 


3.49 ± 


0.17 


0.0014 


26.0 


2.67 X 10-3 ± 2.68 x 10" 


-4 


1.71 ±0.05 


3.69 ±0.13 


2.87 ± 


0.08 


0.0010 


ZUm 


r-dropouts; Zmr^n = 4.74 


25.5 


4.96 X 10-" ±5.11 X 10- 


-5 


2.07 ± 0.07 


5.22 ±0.38 


5.00 ± 


0.37 


0.0142 


26.0 


1.18 X 10-3 ± 1.19 X 10- 


-4 


2.10 ±0.07 


4.07 ±0.18 


3.90 ± 


0.19 


0.0086 


26.5 


2.08 X 10-3 ± 2.09 X 10" 


-4 


2.08 ± 0.04 


3.45 ±0.10 


3.25 ± 


0.09 


0.0060 



Table 3. Same as Table |2| but for the Hyperz redshift distribu- 
tions. 









M-dropouts; z 


= 2 93 










n„ 




7 




h 


!al 


IC 




[/j3Mpc-3] 






[/j-'Mpc] 








24.5 


5.19 X 10-" ± 5.29 X 10- 




1.88 ±0.07 


5.94 ±0.42 


3.66 = 


t0.15 


0.0025 


25.0 


1.50 X 10-3 ± 1.51 X 10" 


-4 


1.68 ±0.03 


4.25 ±0.13 


2.63 = 


t0.06 


0.0013 


25.5 


3.27 X 10-3 ± 3.28 x 10^ 


-4 


1.62 ±0.02 


3.43 ± 0.08 


2.20 = 


t0.02 


0.0009 


him 






g-dropouts; z 


Tiean 3.67 








25.0 


5.05 X 10-" ± 5.13 X 10- 


-5 


1.95 ±0.08 


5.64 ±0.38 


4.22 = 


tO.26 


0.0021 


25.5 


1.29 X 10-3 ± j 29x 10" 


-4 


1.68 ±0.08 


4.83 ±0.30 


3.46 = 


t0.17 


0.0014 


26.0 


2.60 X 10-3 + 2.61 X 10- 


-4 


1.71 ±0.05 


3.83 ±0.15 


2.85 = 


tO.lO 


0.0010 




r-dropouts; Zmsm = 4.47 


25.5 


9.80 X 10-" ± 1.01 X 10 


-4 


2.16 ±0.08 


3.66 ± 0.22 


3.36 = 


tO.23 


0.0031 


26.0 


2.33 X 10-3 ±2.36 X 10" 


-4 


2.17 ±0.07 


2.90 ± 0.67 


2.61 = 


tO.62 


0.0018 


26.5 


4.10 X 10-3 ±4.13 X 10- 


-4 


2.13 ±0.05 


2.46 ± 0.07 


2.18 = 


t0.08 


0.0013 



Table 4. Same as Table ID but for the BC-sim redshift distribu- 
tions including halo model estimates. 





K-drOpOUtS; Zn,eaii =3.16 




;jg 7 ro b^^, IC log^Mh^ao) 
[/!3Mpc-3] [/7-'Mpc] [A-'Mo] 






24.5 
25.0 
25.5 


4.17 X 10-" ±4.24 X 10-5 1.87 ±0.07 6.16 ±0.43 4.00 ± 0.26 0.0036 12.68+|J]? 0.32 ± 0.27 
1.21 X 10-3 ± 1.21 X 10-" 1.68 ±0.03 4.39 ±0.17 2.86 ±0.09 0.0018 12.26+!? '^ 0.70 ±1.02 
2.62 X 10-3 ±2.63 X 10-" 1.61 ±0.02 3.54 ±0.10 2.39 ± 0.05 0.0013 12.06!jj ;3 0.84 ±0.66 


him 


g-dropouts; Zm^an = 3.76 


25.0 
25.5 
26.0 


3.41 X 10-" ± 3.46 X IO-' 1.94 ±0.08 6.02 ± 0.43 4.57 ±0.36 0.0028 12.39+U |!; 0.45 ± 0.33 
8.68 X 10-" ± 8.73 X 10-5 1.68 ±0.08 5.24 ±0.30 3.78 ±0.19 0.0019 12.17+0 * 0.65 ± 0.57 
1.76 X 10-3 ± 1.76 X 10-" 1.70 ±0.04 4.16 ±0.16 3.12 ±0.10 0.0013 I2m+'^°l 0.48 ±0.13 


Z\im 


r-dropouts; z„,ean = 4.73 


25.5 
26.0 
26.5 


2.27 X 10-" ±2.34 X 10-5 2.10 ±0.07 7.10 ±0.53 6.95 ± 0.57 0.0108 12.26+|; |:^ 0.30 ±0.18 
5.40 X 10-" ± 5.46 X 10-5 2.13 ±0.07 5.53 ±0.26 5.41 ±0.29 0.0066 12.08+" '^ 0.39 ± 0.26 
9.49 X 10-" ± 9.56 X 10-5 2.09 ± 0.04 4.71 ±0.17 4.52 ±0.17 0.0046 12.00+j; |3 0.16 ±0.20 



butions more, but note that a decision about redshift evolution or 
non-evolution cannot be taken without additional massive spec- 
troscopic support. 

The same is true for the influence of contamination or in- 
completeness on the clustering measurements. Without a deep 
spectroscopic survey we cannot account for such effects because 
we do not know how these possible low-z contaminants cluster 
or which galaxies at high-z are missed by the Lyman-break se- 
lection. This is a fundamental limitation of a photometric survey. 



However, our simulations suggest - and the colour selection cri- 
teria and magnitude cuts were chosen in such a way - that con- 
tamination and incompleteness are kept low. Thus, we still think 
it is reasonable to assume that the clustering measurements are 
not seriously affected by either of the two effects. 
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Table 5. Same as Table E]but for the Marastonjim redshift dis- 
tributions. 









M-dropouts; z 


= 3 30 














1 




h 


gal 


IC 










[/i-'Mpc] 








24.5 


3.86 X 10-^ ± 3.93 x 10" 


-S 


1.87 ±0.07 


6.19 ±0.40 


4.14 


±0.26 


0.0035 


25.0 


1.12 X 10-3 + ^ j2x 10- 


-4 


1.68 ±0.03 


4.41 ±0.15 


2.96 


±0.09 


0.0018 


25.5 


2.43 X 10-3 ± 2.43 x 10" 


-4 


1.61 ±0.02 


3.55 ±0.10 


2.48 


±0.05 


0.0012 


him 






g-dropouts; z 


Tiean 3.83 








25.0 


3.10 X 10-''±3.15x 10 


-5 


1.94 ±0.08 


6.26 ± 0.46 


4.82 


±0.33 


0.0028 


25.5 


7.89 X 10-'' ± 7.94 x 10" 


-5 


1.68 ±0.08 


5.44 ± 0.28 


3.95 


±0.17 


0.0019 


26.0 


1.60 X 10-3 ± 1.60 X 10 


-4 


1.70 ±0.04 


4.32 ±0.17 


3.26 


±0.11 


0.0013 




r-dropouts; Zms-^n = 4.77 


25.5 


2.12 X 10-''±2.19x 10^ 


-^ 


2.09 ± 0.07 


7.09 ± 0.54 


6.98 


±0.56 


0.0110 


26.0 


5.05 X 10-* ± 5.11 X 10- 


-5 


2.12 ±0.07 


5.51 ±0.27 


5.43 


±0.36 


0.0067 


26.5 


8.88 X 10-* ± 8.94 x 10" 


-5 


2.09 ± 0.04 


4.70 ±0.16 


4.54 


±0.16 


0.0046 



I I I I I I I I I I I I I I — I I I I I 

— «— u -dropouts 
- - g-dropouls 
r-dropouts 



Uyperz 

I I I I I I I I I I I I I — I I I I I I 




Marasionsim 

I I I I — I I I I I I I I I 



" 6 

I 

6 
a 



o 4 - 





I I I I I I I I I I I I I I I 




I I I I I I I I I I I I I I I 



2.2 - 



1.8 
1.6 







1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



8 - 



6 — 



2 - 







I .... I 



I . . .Tl 



I .... I 



I . . .Tl 



I .... I 



I . . .Tl 



I .... I 



I I I I I I I I 



25 25.5 26 26.5 25 25.5 26 26.5 25 25.5 26 26.5 25 25.5 26 26.5 

™lim; rest-rrnme ~1560A ™ lim; rest-frame ~1560i ™ lim; rest-rrame ~1560A ^ lim; real-frame ~I560A 

Fig. 7. Dependence of the correlation length, the correlation function slope, and the galaxy bias parameter on redshift, UV- 
luminosity, and the assumed redshift distribution. The solid lines and the crosses correspond to the M-dropouts, the dotted lines 
and the triangles to the ^-dropouts, and the dashed lines and the squares to the r-dropouts, respectively. We scale the r- and /-band 
limiting magnitudes of the u- and g-dropouts to the z-band limits of the r-dropouts as described in the text. 
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4.3. Halo model 



As detected for g-dropouts (jOuchi et al.l2005[ |Lee et al.|2006| l 
and f/-dropouts ( Hildebrandt et al.||2007| l before, we see a sig- 
nificant deviation of the angular correlation function on small 
scales from the power-law behaviour on large scales for all three 
dropout samples. Especially, for the first time we detect this 
one-halo term contribution to the correlation function in our r- 
dropout sample at redshifts considerably larger than z - 4-. This 
allows us to fit a halo model to our data and estimate mean DM 
halo masses and occupation numbers (i.e. the mean number of 
galaxies hosted by a halo) for the different subsamples. 

We apply the halo model with the same parameters as 



[Ham ana et al. (2004) which was used in several high-z cluster- 
ing studies before (e.g. in Ouchi et al. [2005, Hildebrandt et al 



2007 



et al. 



More advanced halo models are available (see e.g. Lee 



2008\ but we decided to stick to the more simple one in 



order to keep our results comparable to older measurements. We 
only show the case for the BCsim redshift distributions since the 
dependence of halo masses on the assumed redshift distribution 
is similar to the dependence of the correlation lengths and the 
galaxy bias values (see Sect. 4.2 1. 

We fit for the mean galaxy density, rig, and the angular cor- 
relation function simultaneously. The mean galaxy density (also 
tabulated in Table [4]| is estimated from the observed number of 
objects divided by the survey volume, the latter being estimated 
from the unmasked area and the redshift distribution. The error 
of the mean galaxy density is estimated from Poissonian vari- 
ance of the galaxy numbers and from assuming a 10% error on 
the survey volume. We choose the angular fitting range for the 
halo models to 0:02 < 6* < 10'. 

The inferred mean halo masses and mean halo occupation 
numbers for the different flux-limited subsamples are also listed 
in Table [4] and displayed in Fig. [8] Errors are again estimated 
from Monte-Carlo realisations assuming uncorrected Gaussian 
errors on the halo model parameters Mmin, Mi, and a, and we 
add the errors derived from the fluctuations introduced by the 
binning of the data. 

The halo masses for the three redshift samples are within er- 
rors nearly consistent with each other at fixed luminosity. While 
we see again a trend in the halo masses with luminosity in all 
three samples, a significant redshift dependence is not observed. 
Our data seem to favour the observation also made before by 
Hildebrandt et al. ( 2QQ7\ that lower redshift LBGs are hosted by 
more massive halos than higher redshift LBGs of the same rest- 
frame UV-luminosity. If this trend is real, the star-formation rate 
per unit mass decreases with redshift - at least in LBGs. This 
result is, however, highly dependent on the assumed redshift dis- 
tribution. Taking e.g. the BPZ redshift distribution would further 
wash-out this already weak redshift evolution. 

The mean halo occupation numbers are not very well con- 
strained by the fit to the data but the trend that ^ A^^^ < 1 for 
LBGs (as observed before by |Ouchi et al.|2 005'and Hild ebrandt] 
et al. 2007) is confirmed. The large errors seem to originate from 
some tension between the observed angular correlation functions 
and the observed number densities]^ This leads to only weak 
constrains on the Mj parameter which is important to calculate 
(NgY whereas it does not influence the estimate of the (Mhaio)- 
A more elaborate method of calculating the errors on (A^g) drop- 
ping the assumptions of Gaussianity and uncorrelated errors of 



^ See alsol Quadri et al. 1 2008| > for a more severe case of such a dis- 
crepancy between the shape of the correlation function and the number 
density observed with distant red galaxies. 



u -dropouts 
-dropouts 
r -dropouts 




26.5 



lim; rest-frame "-ISGOA 



Fig. 8. Dependence of the mean halo mass and the mean halo 
occupation number on redshift, UV-luminosity for the BCsim 
redshift distribution. Symbols and scalings same as Fig. [7] 



the halo model parameters could potentially decrease uncertain- 
ties here. 

Together with the observations of the strong one-halo term 
these occupation numbers mean that most halos of the mean 
mass reported here are devoid of LBGs and only a small number 
of them host a galaxy or a small group. 

5. Conclusions 

We carried out a precision analysis of LBG clustering at differ- 
ent redshifts from extremely deep optical imaging data in the 
CFHTLS-Deep fields. The LBG samples presented here are by 
far the largest studied so far We detect a clear evolution of the 
large-scale galaxy bias and a tentative evolution of the corre- 
lation lengths with redshift for three of the four redshift distri- 
butions considered here. Higher-z LBGs cluster more strongly 
than lower-z ones. In agreement with former studies, we ob- 
serve a strong trend with luminosity with brighter LBGs show- 
ing larger correlation lengths and galaxy bias values than fainter 
ones. Since the samples were selected from one dataset and the 
whole clustering analysis was done in a coherent way we can be 
confident that the relative comparisons are more accurate than 
comparisons carried out before which involved data from differ- 
ent sources. Also the absolute values of the correlation lengths, 
bias values, and slopes of the correlation functions are in good 
agreement to former studies considering the systematic difficul- 
ties arising from a redshift distribution based on simulations and 
the possible contribution of contaminants to the clustering signal 
(see also [Stanway et al. 12008] for a detailed discussion of this 
aspect). 

The common picture of LBGs forming in massive halos 
of M S IO'^/i^'Mq is supported by our halo model analysis. 
Predictions from models of galaxy formation can be compared 
to ever more detailed observational results in this way. By de- 
tecting a significant one-halo term in the correlation function of 
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r-dropouts for the first time, we have observational evidence for 
muhiple galaxies hosted by single DM halos at very high red- 
shifts. 

In order to take this method to even higher precision and 
constrain models of galaxy formation and evolution even better 
the systematic errors must be better understood and controlled. 
A spectroscopic redshift distribution of representative subsam- 
ples is inevitable for reaching this goal. On the theoretical side 
one must think about more realistic models for galaxy clustering 
since this particular halo model used here is not a very good fit 



to the data, leaving considerable residuals. |Lee et aL (20081 in- 
troduce interesting extensions to the halo model used here. We 
plan to incorporate information on the luminosity functions of 
the LBGs in a future study to learn more about the relation- 
ship between the star formation and the DM structure at high 
redshift. However, here we stuck to the simpler model to keep 
results comparable to older studies. 
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Appendix A: Cross-correlation test of photo-z's 



Similar to |Erben et al.| ( [2009] l we estimate the cross-correlation 
function of galaxies in photo-z slices. In this way the null hy- 
pothesis of non-overlapping slices, which are far apart in red- 
shift, can be tested. See |Erben et al.| ( [2009[ ) for a detailed descrip- 



tion of the technique. In Fig. A.l the auto- and cross-correlation 
functions of galaxies with / < 26.5 in many different photo-z 
slices are shown. The fact that nearly all cross-correlation func- 
tions of non-neighbouring photo-z slices show an amplitude con- 
sistent or very close to zero is a strong argument for the robust- 
ness of the photo-z's. No excessive overlap between low- and 
high-z slices is observed. Thus, even for fainter magnitudes, the 
outlier rates are under control. A more quantitative analysis of 
this method will be presented in a forthcoming paper (Benjamin 
et al. 2009 in prep.). 
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Fig.A.l. Auto- (blue on the diagonal) and cross-correlation (red off-diagonal) functions of galaxies with / < 26.5 in different 
photo-z slices. The blow-up in the bottom left is just an example that illustrates the ranges of the axis. 



